The influence of transcript assembly on the proteogenomics discovery of microproteins

نویسندگان

  • Jiao Ma
  • Alan Saghatelian
  • Maxim Nikolaievich Shokhirev
چکیده

Proteogenomics methods have identified many non-annotated protein-coding genes in the human genome. Many of the newly discovered protein-coding genes encode peptides and small proteins, referred to collectively as microproteins. Microproteins are produced through ribosome translation of small open reading frames (smORFs). The discovery of many smORFs reveals a blind spot in traditional gene-finding algorithms for these genes. Biological studies have found roles for microproteins in cell biology and physiology, and the potential that there exists additional bioactive microproteins drives the interest in detection and discovery of these molecules. A key step in any proteogenomics workflow is the assembly of RNA-Seq data into likely mRNA transcripts that are then used to create a searchable protein database. Here we demonstrate that specific features of the assembled transcriptome impact microprotein detection by shotgun proteomics. By tailoring transcript assembly for downstream mass spectrometry searching, we show that we can detect more than double the number of high-quality microprotein candidates and introduce a novel open-source mRNA assembler for proteogenomics (MAPS) that incorporates all of these features. By integrating our specialized assembler, MAPS, and a popular generalized assembler into our proteogenomics pipeline, we detect 45 novel human microproteins from a high quality proteogenomics dataset of a human cell line. We then characterize the features of the novel microproteins, identifying two classes of microproteins. Our work highlights the importance of specialized transcriptome assembly upstream of proteomics validation when searching for short and potentially rare and poorly conserved proteins.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Influence of Effective Factors on Mechanical Stress on Fingertips During Snap-fit Assembly

Objectives: Nowadays, Snap-fits are used in the automotive industry as a proper alternative for mechanical joints, cabling joints, and car interior lining joints. Due to the special form of these joints, which are assembled manually, the contact area between Snap-fits and the worker’s fingertips can be too small. This can cause skin pain on the worker’s  fingertips. Therefore, ...

متن کامل

The prognostic relevance of BCR-ABL1 transcript type, Sokal score and smoke as synergestic factor with complete cytogenetic response in CML patients treated with different TKI modalities

Background: In chronic myeloid leukemia (CML), the influence of BCR-ABL1 transcript type, Sokal risk score and smoke on disease phynotype and cytogenetic response to treatment is still unknown and arguable. The objective of this study was to determine the prognostic significance of transcript types, risk score and smoking status among patients with CML treated with different tyrosine kinase inh...

متن کامل

Engineering Nano-aggregates: β-Cyclodextrin Facilitates the Thiol-Gold Nanoparticle Self-Assembly

The structure and morphology of nonmaterial formed by colloidal synthesis represent a subject of interest as it is a factor deciding the physicochemical properties and biological applications of nanostructures. Among various nanoparticles, gold can develop fractal assembled patterns. Herein, we report a nano-aggregate of a thiol-on-gold self-assembled structure and the influence of β-cyclodextr...

متن کامل

Identification of Microprotein–Protein Interactions via APEX Tagging

Microproteins are peptides and small proteins encoded by small open reading frames (smORFs). Newer technologies have led to the recent discovery of hundreds to thousands of new microproteins. The biological functions of a few microproteins have been elucidated, and these microproteins have fundamental roles in biology ranging from limb development to muscle function, highlighting the value of c...

متن کامل

Regulation of protein function by 'microProteins'.

Many proteins achieve their function by acting as part of multi-protein complexes. The formation of these complexes is highly regulated and mediated through domains of protein-protein interaction. Disruption of a complex or of the ability of the proteins to form homodimers, heterodimers or multimers can have severe consequences for cellular function. In this context, the formation of dimers and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 13  شماره 

صفحات  -

تاریخ انتشار 2018